NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Doing Experiments and Revising Rules With Natural Language and Probabilistic Reasoning

Piriyakulkij, Top Wasu; Langenfeld, Cassidy; Le, Tuan-Anh; Ellis, Kevin (December 2025, NeurIPS)

Free, publicly-accessible full text available December 10, 2026
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students’ Formative Assessment Responses in Science

https://doi.org/10.1609/aaai.v38i21.30364

Cohn, Clayton; Hutchins, Nicole; Le, Tuan; Biswas, Gautam (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

This paper explores the use of large language models (LLMs) to score and explain short-answer assessments in K-12 science. While existing methods can score more structured math and computer science assessments, they often do not provide explanations for the scores. Our study focuses on employing GPT-4 for automated assessment in middle school Earth Science, combining few-shot and active learning with chain-of-thought reasoning. Using a human-in-the-loop approach, we successfully score and provide meaningful explanations for formative assessment responses. A systematic analysis of our method's pros and cons sheds light on the potential for human-in-the-loop techniques to enhance automated grading for open-ended science assessments.
more » « less
Full Text Available
Utilizing Textual Reviews for Visualizing and Understanding User Preferences

https://doi.org/10.1145/3625007.3627486

Pham, Dang; Le, Tuan (November 2023, ACM)

Full Text Available
Focused Stochastic Neighbor Embedding for Better Preserving Points of Interest

https://doi.org/10.1109/BDCAT56447.2022.00043

Ramirez, Rafael Baez; Kumar, Sanuj; Le, Tuan M.; Cao, Huiping (December 2022, 2022 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT))

Full Text Available
A Word Embedding Topic Model for Robust Inference of Topics and Visualization

https://doi.org/10.1145/3486001.3486002

Kumar, Sanuj; Le, Tuan (October 2021, The First International Conference on AI-ML-Systems)

Full Text Available
Focused Stochastic Neighbor Embedding for Better Preserving Points of Interest

Ramirez, Rafael Baez; Kumar, Sanuj; Le, Tuan M.; Cao, Huiping (January 2022, Proceedings of the IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT2022))

Full Text Available
Advances in Knowledge Discovery and Data Mining

Varela, Edgar Ceh; Cao, Huiping; Le, Tuan (May 2021, Lecture notes in computer science)

Full Text Available
Neural Topic Models for Hierarchical Topic Detection and Visualization

https://doi.org/10.1007/978-3-030-86523-8_3

Pham, Dang; Le, Tuan (January 2021, Joint European Conference on Machine Learning and Knowledge Discovery in Databases)

Full Text Available
Auto-Encoding Variational Bayes for Inferring Topics and Visualization

Dang, Pham; Le, Tuan (December 2020, Proceedings of the 28th International Conference on Computational Linguistics)
null (Ed.)
Visualization and topic modeling are widely used approaches for text analysis. Traditional visualization methods find low-dimensional representations of documents in the visualization space (typically 2D or 3D) that can be displayed using a scatterplot. In contrast, topic modeling aims to discover topics from text, but for visualization, one needs to perform a post-hoc embedding using dimensionality reduction methods. Recent approaches propose using a generative model to jointly find topics and visualization, allowing the semantics to be infused in the visualization space for a meaningful interpretation. A major challenge that prevents these methods from being used practically is the scalability of their inference algorithms. We present, to the best of our knowledge, the first fast Auto-Encoding Variational Bayes based inference method for jointly inferring topics and visualization. Since our method is black box, it can handle model changes efficiently with little mathematical rederivation effort. We demonstrate the efficiency and effectiveness of our method on real-world large datasets and compare it with existing baselines.
more » « less
Full Text Available
Multi-criteria and Review-Based Overall Rating Prediction

https://doi.org/10.1007/978-3-030-75765-6_38

Ceh-Varela, Edgar; Cao, Huiping; Le, Tuan (January 2021, Pacific-Asia Conference on Knowledge Discovery and Data Mining)

An overall rating cannot reveal the details of user’s preferences toward each feature of a product. One widespread practice of e-commerce websites is to provide ratings on predefined aspects of the product and user-generated reviews. Most recent multi-criteria works employ aspect preferences of users or user reviews to understand the opinions and behavior of users. However, these works fail to learn how users correlate these information sources when users express their opinion about an item. In this work, we present Multi-task & Multi-Criteria Review-based Rating (MMCRR), a framework to predict the overall ratings of items by learning how users represent their preferences when using multi-criteria ratings and text reviews. We conduct extensive experiments with three real-life datasets and six baseline models. The results show that MMCRR can reduce prediction errors while learning features better from the data.
more » « less
Full Text Available

« Prev Next »

Search for: All records